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Abstract 


Dempster-Shafer evidence theory has been widely used in various applications. However, 
to solve the problem of counter-intuitive outcomes by using classical Dempster-Shafer com- 
bination rule is still an open issue while fusing the conflicting evidences. Many approaches 
based on discounted evidence and weighted average evidence have been investigated and 
have made significant improvements. Nevertheless, all of these approaches have inherent 
flaws. In this paper, a new weighting factor is proposed to address this problem. First, a 
modified dissimilarity measurement is proposed which is characterized by both distance 
and conflict between evidences. Second, a measurement of information volume of each evi- 
dence based on Deng entropy is introduced. Then two kinds of weight derived from afore- 
mentioned measurement are combined to obtain a new weighting factor and a weighted 
average method based on the new weighting factor is proposed. Numerical examples are 
used to illustrate the validity and effectiveness of the proposed method. In the end, the new 
method is applied to a real-life application of river water quality monitoring, which effectively 
identify the major land use activities contributing to river pollution. 


Introduction 


Dempster-Shafer (D-S) evidence theory provides a reasonable and efficient way to deal with 
the information which is uncertain and discordant. It has been extensively used in various 
applications related to decision-making such as information fusion [1-3], uncertain reasoning 
[4], fault diagnosis [5], risk analysis [6-9], cognitive map [10], target recognition and associa- 
tion [11-13]. Unlike the probability theory and Bayesian theory, the D-S evidence theory 
requires few prior conditions and knowledge when information is processed. For example, the 
Evidential Reasoning (ER) algorithm is a generalized Bayesian inference process and the ER 
rule reveals that the combined degree of joint support for a proposition from two pieces of 
independent evidence constitutes two parts in general [14-16]. When there is no priori infor- 
mation, the ER rule will reduce to the D-S combination rule. Moreover, the combination rule 
of the D-S evidence theory satisfies some of mathematical properties, such as commutativity 
and associativity. However, counter-intuitive results may occurred by the normalization step 
of the classical D-S combination rule when collected sources of evidence highly conflict with 
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each other, as pointed out by Zadeh [17]. The effectiveness of the D-S evidence theory will be 
considerably reduced by this deficiency. 

Evidently, it is crucial to handle the evidences with high conflict. In the last few years, many 
researchers have carried out comprehensive research and have applied a series of modifica- 
tions to the conventional evidence combination rule [18-24]. In general, the existing methods 
can be divided into two kinds of solutions: revisal of the combination rule and revisal of the 
evidences, for the problem of high level conflict evidence fusion. Proponents of the first argu- 
ment present that illogical results are caused by inappropriate distribution of the conflict infor- 
mation. Therefore, the modified methods based on the revisal of the combination rule mainly 
focus on altering the assignment of conflict information [2, 18, 21, 25-27]. Among them, solu- 
tions of the transferable belief model (TBM) and Dezert-Smarandache theory (DSmT) are 
more popular. The TBM develops a method to transfer the basic belief assignments (BBAs) to 
probabilities, but the method only can be used in closed world [25, 28]. DSmT extends the 
assignment universe of BBAs from a power set to a super power set which is more thorough 
and complete, and corresponding combination models and rules are developed as well [27]. 
Nonetheless, the modified combination rules have limitations in some situation. For example, 
most of the modified combination rules are not commutative and associative and are time 
consuming when dealing with a large amount of evidences. 

The other modification, revisal of the evidences, preprocesses conflict evidences before 
combination process. The favorable mathematical properties of the D-S combination rule are 
reserved in the improvement as they do not change the D-S combination rule. Many related 
work have been proposed to support this modification method [22-24, 29-32]. Murphy [29] 
generates a new evidence by averaging N evidences with equal weights and then combine it 
with N-1 times. Based on this idea, Deng [22] proposes a weighted averaging method to obtain 
the new evidence. Besides the weighted averaging method, the discounting method also plays 
an important role in preprocessing conflict evidences. The weighting factor of both the 
weighted average method and discounting method can be identified by evidence distance, 
which is usually used to describe the conflict or dissimilarity [33-35]. Liu [35] argues that the 
conflict coefficient k in the evidence theory is inadequate to reflect the degree of conflict and 
dissimilarity between evidences, and he utilizes a two-dimension cell <evidence conflict, evi- 
dence distance> to measure the dissimilarity between evidences. The cell is indeed more com- 
prehensive and adequate than the single coefficient k when describing the dissimilarity, but it 
also has intrinsic shortcomings in practical situation. For instance, the conflict tolerance 
threshold £ is largely subjective and depends on the perception of a decision maker. In addi- 
tion, the cell is not syncretized in the combination rule. A dissimilarity measure is proposed 
on the basis of Hamacher T-conorm fusion rules given by Liu, who considers not only the evi- 
dence conflict and distance but also combines it in the combination rule as a discount [36]. 
Nevertheless, there are twofold limitations associated with the mathematical modeling. First, 
since the conflict factors only use the maximal subjective probability of the BBAs, it cannot 
solve the situation related to the propositions with equal belief values, which are investigated 
in Section 3. Second, combining the evidence one by one has a low convergence rate. 

Besides evidence conflict and distance, the evidence volume is another criterion to measure 
the importance of an evidence [37]. If an evidence has more information, it should have a 
greater impact on the final aggregated result. Deng entropy [38], as a generalization of Shan- 
non entropy, can measure the evidence information volume under the framework of D-S evi- 
dence theory. In this paper, Deng entropy and modified dissimilarity measure are used to 
form a new weighting factor. Then the new combination rule of evidence is carried out based 
on the new weighting factor, which has improved the versatility and has a fast convergence 
rate. 
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This paper is organized as follows. Section 2 describes some basic concepts related to the 
D-S evidence theory and dissimilarity measure. Section 3 presents problems of existing conflict 
coefficients, especially the limitations of Liu’s method. Section 4 investigates the new weighting 
factor of modified dissimilarity and Deng entropy, and some examples and analysis are pre- 
sented to show the superiority and effectiveness of proposed method. In Section 5, the pro- 
posed method is used in a real-life application of the identification of water pollution sources. 
Finally, conclusions are drawn in Section 6. 


Preliminaries 
2.1 Basics of D-S evidence theory 


Definition 1. Suppose O be a nonempty finite set of mutually exclusive alternatives and defined 
as frame of discernment. Set of all the possible subsets of O, denoted by 2°, is called power set. 
The mapping m: 2° — [0,1] is defined as the basic belief assignment (BBA) (also known as 
basic probability assignment, BPA) [39, 40]. The BBA satisfies 


Xacom(A) =1 (1) 


m(0) =0 (2) 


where m(A) reflects the strength of each of evidence support for the proposition A in the frame 
of discernment, and @ denotes the empty set of O. A is called the focal element, if m(A) > 0. 

Definition 2. The belief function Bel(A) and plausibility function PI(A) from a BBA are 
defined as 


Bel(A) = Spcqtn(B) (3) 


PI(A) = SenaB) (4) 


where Bel(A) represents the amount of belief that definitely support A, and the PI(A) could be 
viewed as the amount of belief that potentially placed in A. 

Definition 3. Let m, and m, be two BBAs defined on the same frame O. D-S evidence theory 
combination rule is expressed as 





m(A) = 1-k (5) 


Eocam (BIMO yg 
0 A=0 
with 
k = X no-m (B)m,(C) (6) 
where k is named as conflict coefficient to measure the degree of conflict between two BBAs. 
The combination rule is out of work when k = 1. 


Zadeh [17] presents a famous example that the D-S combination rule will produce an unex- 
pected result. Suppose a frame is O = {A,B,C} and two BBAs are given as 


m,:m,(A) = 0.99,m,(B) = 0.01 
m, : m,(B) = 0.01,m,(C) = 0.99 


by the D-S combination rule, the aggregated result is k = 0.9999, m(A) = m(C) = 0 and m(B) = 
1, which is obviously counter-intuitive and unreasonable. 
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2.2 Jousselme distance 


Jousselme distance [32], considering both the mass and cardinality of focal elements of each 
BBA, is commonly used as the measure of dissimilarity. 

Definition 4. Let m, and m, be two BBAs on the same frame O, containing N mutually 
exclusive and exhaustive propositions. The Jousselme distance between m; and m are defined 
as 





dm = ¥/0.5 + (lmi? + ljm? — 20m, m,)) 4) 


Img 
where ||m:||? = (m,m), ||m]|? = (mm) and (m,m) is given by 


|A,NB,| 
|A; UB| 





(m,,m,) = 02,07 ,m, (A,)m,(B,) (8) 


with A; and B; are the elements of the power set 2°. |A; N B;| and |A; U B;| denote the cardinality 
intersection set and union set of A; and B}. 


2.3 Probabilistic-based distance 


Since the probalilistic transformation has an ability to convert a BBA from the focal elements 
into a probability measure of distinct atomic, it provides a probabilistic-based distance to mea- 
sure the dissimilarity of two evidences [41]. 

Definition 5. Let m be a BBA on a frame O, and the probabilistic expression of a singleton 
element B in O could be obtained by pignistic probability function 
m(A) 


TAL (9) 


BetP,,(B) = J 4e09 aca 


where |A] is the cardinality of proposition A. If |A| = 1, then B = A and BetP(B) = BetP(A) = m 
(A). 

Definition 6. Let m, and m, be two BBAs on the same frame O and let BetP,, and BetP,,, be 
the results of pignistic probability transformation of m, and mp, the probabilistic-based dis- 
tance difBetP” is defined as 


difBetP™ = massco(|BetP,, (A) — BetP,,, (A)|) (10) 


and the Murkowski distance [36] proposed by Liu is defined as 
distP = J 4,co0-5 * (|BetP,, (A;) — BetP,,,, (A;)|) (11) 


2.4 Combinatorial dissimilarity measure 


Some compound dissimilarity measures are presented based on the conflict coefficient, evi- 
dence distance and probabilistic-based distance. 

Definition 7. Let mı and m, be two BBAs on the same frame O, and a combinatorial dissimi- 
larity measure based on the conflict coefficient kr and probabilistic-based distance difBetP”? 


is defined as 
fmt = (kri difBetP™) (12) 


m; and mz are in conflict, iff both k”? > £ and difBetP > £. £ € [0,1] denotes the threshold 
of conflict tolerance, and identified according to different applications [35]. 





PLOS ONE | https://doi.org/10.1371/journal.pone.0177695 May 25, 2017 4/20 


-PLOS | ONE 


A new weighting factor in combining belief function 





Definition 8. Let m, and m, be two BBAs on the same frame O, and a combinatorial dissimi- 
larity measure [42] based on the conflict coefficient k”? and Jousselme distance qn, is defined 
as 

1 m2 mə 
kı = 3 x (kpi + dyp) (13) 

Definition 9. Let m, and m, be two BBAs on the same frame O and let BetP,, and BetP,,, be 

the results of pignistic probability transformation of m, and mz. Then a combinatorial dissimi- 


larity measure based on the Hamacher T-conorm fusion rules [43] is defined as 


ssa distP™ + ConfP™ 14 
ismP™ &T(distP”? , ConfP”™) = m m; 
m=! (distP , ConfPi) 1 + distPr? - ConfPh: = 





where ConfP'? denotes the conflict coefficient based on the pignistic probability, 
0, if BetP(Xm*) N BetP(Xr*) #0 


ConfP™ = ; 
BetP(Xi"*) - BetP(X*), otherwise 


my 


where BetP(Xi") = arg max,-6 


BetP,,,(x),i= 1,2 


2.5 Deng entropy 


Deng entropy, as a generalization of Shannon entropy, provide a solution to measure the infor- 
mation volume of a BBA. It is observed that the Deng entropy and Shannon entropy corre- 
spond to an uncertain degree of measurement [38]. 

Definition 10. Let m be a BBA on the frame O and the Deng entropy of m is defined as 


m(A,) 
9l4il _ 





E, = —diim(A,)log (16) 
where A; is a proposition in BBA m, and |A;| is the cardinality of A;. The Deng entropy will 
become identical to Shannon entropy if |A;| = 1, that is 


E, = —Sym(A)log SA) = -5 m(A,logrn(A,) (17) 





Limitations of exiting dissimilarity measurements between BBAs 


Example 1. Let mı, m, and m; be three BBAs on the same frame © with four propositions O = 
{A},A2,A3,A4}. The three BBAs are given as 


m, : m,(A,) = m,(A,) = m (43) = m,(A,) = 0.25 
m, : m,(A,) = m,(A,) = m,(A3) = m,(A,) = 0.25 


m, : m,(A,) = m,(A,) = m,(A;) = 1/3 


we can get the conflict coefficients kp? = 0.75 and ky = 0.67 by using Eq (6) between the 
BBAs. The result shows that the degree of conflict between m, and m, is bigger than the degree 
of conflict between m, and m3, and they both are in relative high conflict. In fact, there is no 
conflict intuitively between m, and m, because they are the same. By using the Eq (10) and Eq 
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(12), we can get the difBetP = 0 and cf(m,,mz) = (0.75,0), which illustrates that m, and m3 
are consistent and measurement of coefficient k cannot measure the degree of conflict between 
the evidences in this situation. Although the combined measurement implies that the D-S 
combination rule should be used, it cannot conclude that how much the error will be conduct 
by using the combination rule. Therefore, the combination rule has a limitation in terms of 
providing an explicit expression and cannot be used directly in the combination rule. 

Example 2. Let m, and m, be two BBAs on the same frame © = {A),Ay,. . .Az,}, such that 


m,: m,(A,) = m,(A,) =--- = m,(A,) z 


1 
m, : m,(A,.1) = m (Ana) = = M(An) = z 


It is obvious that the m, and m, are totally contrary to each other as they support the different 
propositions. The different dissimilarities between m, and mz are displayed in Fig 1. 

From Fig 1, it is evident that the values of the d;, kg, and difBetP are 1, when n = 1, which 
are intuitive. But when n > 1, the values of d; and difBetP tend to 0, and kg tends to 0.6, in- 
dicating that m, and m, are getting closer and less conflict with the increase of n, which are 
counter-intuitive and abnormal. Only the DismP keeps 1 with the increase with n, meaning 
that the m, and m, are totally in disagreement with each other. Therefore, dy, kg, and difBetP 
cannot be used as measurement of the dissimilarity between BBAs in this example. 














1 |= Å, A A O A A TAN FAN A, A A, A A A A A Ey LA A A\ 
X —*— dj —4— ky —=— difBetP —4— DismP 
0.8 F Q 4 
> aN 
V $ i 
ji N - S Z A A 
0.6 F Tp 4 
AY 
0.4 F J 
IN 
N 
0.2 F N 
> al 
St = HS = 5 = h 








0 5 10 15 20 
|9/2|=n 
Fig 1. Different dissimilarity measurements. 


https://doi.org/10.1371/journal.pone.0177695.g001 
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Since DismP considers not only the distance but also the conflict between BBAs, the mea- 
surement based on the Hamacher T-conorm fusion rules provides a general method of the dis- 
similarity. However, it has dificiency as shown in Example 3. 

Example 3. Let m, and m, be two BBAs on the same frame of discernment O = {A4,A2 . 
Ao}. For notation conciseness 1, 2, and so forth have been used to denote A4,A2, and so forth 
in the frame. The two pairs of BBAs are shown as 

lst Pair: 


m, : m, (2,3,4) = 0.05, m, (7) = 0.05, m, (O) = 0.1, m, (4) = 0.8 


My, m,(1, 2,3, 4,5) =1 


2nd Pair: 
m, : m,(2,3,4) = 0.05, m,(7) = 0.05, m,(@) = 0.1, m,(A) = 0.8 


m, : m,(1,2,3,4,5) = 0.5, m,(6,7,8,9,10) = 0.5 


where the A is a subset of O. This example considers 20 cases of the subset A, which increases 
by adding a new element at each case from A = {1} to A = {1,2,. . .,20}. The comparison of the 
dissimilarity measurements between m, and m, of the two pairs are shown in Figs 2 and 3 
respectively. 

From Figs 2 and 3, it can be seen that the distP and DismP are very close and follow the 
same trend, since the DismP are mainly decided by the distP and ConfP. As calculated in Eq 
(15), the maximal pignistic probabilities in both m, and m, always have intersection. So ConfP 
is small when the A increases from {1} to {1,2,. . .,20}. However, it is quite distinct situation of 
the two pairs of BBAs. In the 1st pair, both BBAs distribute their major belief to the same ele- 
ments when the cases from 1 to 6, which cause that the ConfP keeps 0. This is reasonable as the 
Mp only has one focal element m,(1,2,3,4,5) = 1 which corresponds to classical conflict coeffi- 
cient k. In the 2nd pair, m, has two equal focal elements m(1,2,3,4,5) = 0.5 and m2(6,7,8,9,10) = 
0.5. As the case from 1 to 5, there should be a notable dissimilarity between m, and m2, which 
is shown as k in Fig 3. Nevertheless, the pignistic probability transformation divides the belief 
equally to each single proposition as BetP,,, (1) = BetP,,, (2) =--- = BetP,,,(10) = 0.1, indi- 
cating that the ConfP considers the dissimilarity as 0. Therefore, the dissimilarity measures of 
ConfP and DismP are illogical in this situation. Although the classical conflict coefficient k could 
depicts the dissimilarity from cases 1 to 5, it cannot reflect the variety of divergence degree as 
the case increases. Neither does the difBetP. 


Combining belief function with a new weighting factor 
4.1 A modified dissimilarity measure 


In this section, a modified dissimilarity measure is proposed which is based on the Hamacher 
T-conorm fusion rules to describe the dissimilarity between BBAs. The dissimilarity measure- 
ment based on Hamacher T-conorm rules satisfy two important properties of commutativity 
and monotonicity. The commutativity could ensure that the dissimilarity matrix is symmetri- 
cal and no matter the fusion order of two evidences is, their dissimilarity is coincident. The 
monotonicity provides that dissimilarity measurement has single variation trend in a specific 
interval, which is easy to compare the dissimilarity between evidences. 
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Fig 2. Comparison of dissimilarity measures of the 1st pair evidence. 


https://doi.org/10.1371/journal.pone.0177695.g002 





Definition 11. Let m, and m, be two BBAs on the same frame O. The modified dissimilarity 
measure is defined as 


. m2 a ‘ m p.m, distP + as 
MDismPr 7 (distPm ; km) = T+ dith KE (18) 


where kr is the classical conflict coefficient. 


Kn = Xana- (A;)m,(A;) (19) 


The modified dissimilarity still satisfies the basic properties of commutativity and 
monotonicity: 
(1) Commutativity: 


MDismP(x,y) = MDismP(y, x) (20) 
(2) Monotonicity: 


0 < MDismP(x, y) < MDismP(x ,y) < MDismP(x,y) <1 (21) 





whereQ<x<x <landO<y<y <1. 
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Fig 3. Comparison of dissimilarity measures of the 2nd pair evidence. 





https://doi.org/10.1371/journal.pone.0177695.g003 


The modified dissimilarity measurement consists of a distance coefficient and a conflict 
coefficient between two evidences. As both the distance coefficient and the conflict coefficient 
lie in [0,1], the modified measurement is larger than its either components. The evidences 
have large distance and high conflict with the majority of other evidences would have a larger 
dissimilarity measurement and vice versa. 

It is obvious that the modified dissimilarity measure replaces the conflict coefficient 
ConfP 7? with kp. The ConfP*? implies that the main conflict results from discordant proposi- 
tions which are strongly supported by two BBAs respectively. However, the ConfP*? cannot 
handle the situation that one BBA has several propositions with equal belief. As we see it, the 
conflict coefficient should involve all conflicts existed between BBAs no matter how small the 
extent of conflicts is. Furthermore, the MDismP not only maintains good features but also 
makes up for shortcomings of DismP. 

Example 4. Considering two pairs of BBAs from Example 3 with the proposed dissimilarity 
measure of MDismP, the results are plotted in Figs 4 and 5. 

For the results of the 1st pair illustrated in Fig 4, the distP, DismP and MDismpP are identical. 
The lines show a variation tendency from a high dissimilarity when A = {1} to the minimum 
dissimilarity when A = {1,2,3,4,5} and increase again as A includes more elements. This is 
because the m only has one proposition and when A = {1,2,3,4,5}, the propositions with the 
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Fig 4. Comparison of dissimilarity of the 1st pair evidence. 





https://doi.org/10.1371/journal.pone.0177695.g004 


maximum belief of two BBAs are accordant. In Fig 5, when m, has two propositions with 
equal belief value, the results are different. The MDismP has a bigger value than the distP and 
DismP when A from {1} to {1,2,3,4,5}. The dissimilarity value of MDismP is near 0.8 before 
cases 6, which means the two BBAs are incompatible with each other. But the dissimilarity val- 
ues of distP and DismP are less than 0.6, which seems unreasonable. Based on the analysis of 
the above examples, a conclusion can be drawn that the modified dissimilarity measure 
MDismP can efficiently reflects the degree of dissimilarity between BBAs. 


4.2 Weighting factors 


In this section, we propose a novel method to determine the weighting factors among BBAs 
based on the modified dissimilarity measure and Deng entropy. 

The weight determinations are based on the principle that if an evidence is supported by 
greater number of evidences, this piece of evidence should be more important and have large 
effect on the final combination results. Moreover, if an evidence has considerable information, 
it also should be weighted more [37]. 

Suppose N evidences {m1,mp,. . my} are in the same frame of discernment © and the 
weight of each evidence is made up of the degree of similarity and information volume. The 
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Fig 5. Comparison of dissimilarity of the 2nd pair evidence. 





https://doi.org/10.1371/journal.pone.0177695.g005 


similarity degree Sim,, of m; and m; is defined as 


Sim, ; = 1 — MDismP(m,, m;) (22) 


the mutual similarity degree matrix Syxy is then defined as 


1 Sim,, +++ Sim, y 
Sim, ı 1 +++ Sim, y 
Syxw = T . . . (23) 
Simy, Simy, ` 1 


The Sy is symmetrical and Sim;; = Sim;; means that the similarity between two evidences fulfills 
the commutativity. The diagonal element is 1 means that an evidence is totally similar with itself. 
The similarity degree matrix helps give an insight into the agreement between evidences, and the 
weighting factor of dissimilarity could be obtained based on the similarity degree matrix. 





PLOS ONE | https://doi.org/10.1371/journal.pone.0177695 May 25, 2017 11/20 


-PLOS | ONE 


A new weighting factor in combining belief function 





The weighting vector W“ of each evidence is associated with the eigen vector of the maximal 
positive eigen value Amav that is Amax * WS = Syy WS, The evidence with the largest weight 
is deemed to be the most important evidence and the weight of each evidence is revised as 

dis _ WwW? i 


= —_ i=(1,2,...,N 24 
fs aa W P= oN) (24) 


the information volume of each evidence is measured by Deng entropy and can be calculated by 
Eq (16). After Deng entropy of each evidence is processed, the weights of information volume 
of evidence are obtained by 


Deng — E; (m;) 


; = jaan E,(m) T Deo) (25) 


then the weight of each evidence based on the proposed method is defined as 


of bom 
0; aos (o® + oP) i= (1,2,...,N) (26) 
HO} i 





The final weighting factor is measured by the weighting factor of dissimilarity measurement 
and the weighting factor of evidence information. The two weighting factors describe the final 
weight from two aspects: the factor of dissimilarity measurement depicts the mutual degree of 
deviation of an evidence with other evidences; the factor of evidence information illustrates the 
relative amount of information of an evidence compared to others. The final weighting factor is 
more thorough than the other weighting (discounting) factors which just depend on the dissim- 
ilarity measurement. With the weighting factor of information volume, the new weighting fac- 
tor still hold the capability to avoid fusion error caused by the single information source failure. 


4.3 Combination of evidences 


The weighted average [29] and discounted [36] methods are two kinds of approaches that have 
been proposed for combining the evidences. The discounted method distributes the remaining 
mass value of the discounted mass to the universal set O, but this would result in more uncer- 
tainty. As the discounted methods need to combine the evidences one by one, the calculation 
will be time consuming and have a low convergence when a large amount of evidences need to 
be combined. The weighted average method would reinforce each other if the evidences are 
concordant and would weaken each other if the evidences are in conflict. The belief of proposi- 
tions after combination will remain distinct from each other. In addition, the weighted average 
method is easily computational and more reliable and rational. Hence, the weighted average 
method is applied in this article. 

The evidence generated by weighted average method is 


EWA(m) = S>,@,m,, i = (1,2,...,N) (27) 


For N original evidences, we should combine the new evidence EWA(m) for N — 1 times. In 
this section, an example of target recognition [36] is presented to show some behaviors of the 
existing method as well as the proposed method. 

Example 5. Let mı, m2, m3, m4 and ms be five BBAs on the same frame of discernment 0 = 
{A,,A2,A3} as shown in Table 1. The combination results obtained with the proposed methods 
are shown in Table 2. The convergence is shown in Table 3. mi means fusion of evidences m, 
mz. . M; that ism’ = m Bm, O... Qm.. 

As can be seen from Table 2, the D-S combination rule (without the discounting or 
weighted average process) concludes that the proposition A, is almost be regarded as the 
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Table 1. BBAs of five evidences. 
































m, m m3 m, | Ms 

A 0.8 0.4 0 0.3 | 0.45 
A 0.1 0.2 0.95 0.2 | 0.1 
As 0 0.1 0.05 0.25 0 
{A1,Ao} 0 0.3 0 0.2 0 

{A2,A3} 0 0 0 0 | 0.15 
o 0.1 0 0 0.05 | 0.3 





https://doi.org/10.1371/journal.pone.0177695.t001 


target. The result is unreasonable and counter-intuitive since majority of evidences distribute 
the major belief to proposition A, and just one evidence m; give its major belief to A2. Such 
unexpected behavior is solved by using the discounting and weighted average method to lessen 
the influence of the evidences which is dissimilar with the other evidences. The larger the dis- 
similarity of an evidence, the larger discount it will have. However, one also sees that the pro- 
cess of the proposed method as the dissimilarity measure to determine the weight of each 
evidence generates a more specific and reliable result than the process of discounting factors 
based on d; and DismP. In addition, the m? is larger than the m? of m(A,) of the proposed 
method, which is completely opposite of d; and DismP when the m; assign its major belief 

to A2. What caused this is that the proposed new weighting factor weakens m, twice by the 
weighting factor of dissimilarity and Deng entropy. For dy and DismP, the discount factor 
weakens m; once, which overcomes the defect raise by the classical combination rule. Besides, 
the evidence m; has less information when compared to other evidences. So, weighting factor 
of Deng entropy further weaken the evidence m; on the basis of the weighting factor of the dis- 
similarity factor. The larger m? of m(Aj) of the proposed method reflects the proposed new 
weighting factor has a great capability of anti-interference. Furthermore, the proposed method 
has a better performance of convergence than the other methods. As can be seen from Table 3, 
the series results of proposed method achieve the belief level of other methods before the 


Table 2. Comparison results of different methods. 

















mi mi mi mi 
D- S[36] m(A,) = 0.8451 m(A3) = 0.9948 m(A}) = 0.9965 m(A3) = 0.9971 
m(As) = 0.0986 m(As) = 0.0052 m(As) = 0.0035 m(As) = 0.0029 
m(As) = 0.0140 
m(Aj,2) = 0.0423 
du [36] m(Ay) = 0.7659 m(A;) = 0.6239 m(A;) = 0.6858 m(A;) = 0.7528 
m(Az) = 0.1166 m(As) = 0.2791 m(Ap) = 0.2645 m(As) = 0.2217 
m(Ag) = 0.0294 m(Ag) = 0.0252 m(Ag) = 0.0146 m(Ag) = 0.0096 
m(A; 2) = 0.0881 m(Ay 2) = 0.0718 m(Ay 2) = 0.0315 m(Ay 2) = 0.0159 
DismP [36] m(A;) = 0.7503 m(A,) = 0.7157 m(A,) = 0.7670 m(Ay) = 0.8254 
m(As) = 0.1196 m(Aps) = 0.1598 m(As) = 0.1655 m(As) = 0.1424 
m(Ag) = 0.0319 m(Ag) = 0.0308 m(Ag) = 0.0194 m(Ag) = 0.0120 
mA; 2) = 0.0957 mA, 2) = 0.0913 mA; 2) = 0.0477 mA, 2) =0.0198 
m(@) = 0.0025 m(@) = 0.0024 m(@) = 0.0004 m(As a) = 0.0002 
m(©) = 0.0002 
Propsed m(A,) = 0.7974 m(A,) = 0.8227 m(A;) = 0.8277 m(Ay) = 0.8651 
Method m(Az) = 0.1276 m(Az) = 0.1584 m(Az) = 0.1588 





m(Ag) = 0.0114 
m(A;,2) = 0.0610 
m(@) = 0.0026 





m(Ag) = 0.0019 
mA, 2) = 0.0169 
m(@) = 0.0001 





m(Ag) = 0.0036 
m(A: 2) = 0.0099 





m(Ap) = 0.1249 
m(Ag) = 0.0048 
mA, 2) = 0.0046 
m(Ap,3) = 0.0004 
m(6) = 0.0001 





https://doi.org/10.1371/journal.pone.0177695.t002 
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Table 3. The convergence of the proposed method. 


Fusion time n 


3 


m; mi mi 
n=1 m(A,) = 0.7974 m(A,) = 0.7160 m(A,) = 0.6356 m(A,) = 0.6158 
m(A2) = 0.1276 m(Az) = 0.2149 m(Ag) = 0.2424 m(A2) = 0.2247 
m(Ag) = 0.0114 m(Ag) = 0.0115 m(Ag) = 0.0442 m(Ag) = 0.0555 
m(A1,2) = 0.0610 m(A; 2) = 0.0553 m(Aj,2) = 0.0748 m(A1,2) = 0.0676 
m() = 0.0026 m(@) = 0.0023 m(@) = 0.0029 m(Ao.3) = 0.0169 
m(@) = 0.0195 
n=2 m(A;) = 0.8227 m(A,) = 0.7523 m(A;) = 0.7292 
m(As) = 0.1584 m(Az) = 0.2066 m(Az) = 0.2051 
m(Ag) = 0.0019 m(Ag) = 0.0131 m(Ag) = 0.0270 
mA; 2) = 0.0169 mA; 2) = 0.0279 m(A1,2) = 0.0297 
m(©) = 0.0001 m(©) = 0.0002 m(A23) = 0.0055 
m(©) = 0.0035 
n=3 m(A;) = 0.8277 m(A,) = 0.8086 
m(A2) = 0.1588 m(A2) = 0.1654 
m(Ag) = 0.0036 m(Ag) = 0.0118 
m(A; 2) = 0.0099 m(A; 2) = 0.0120 
m(Ao 3) = 0.0016 
m(©) = 0.0006 
n=4 


https://doi.org/10.1371/journal.pone.0177695.t003 


m(A,) = 0.8651 
m(Ao) = 0.1249 
m(Ag) = 0.0048 
m(A; 2) = 0.0046 
m(Ao.3) = 0.0004 
m(@) = 0.0001 


completion of the fusion process. For example, the three times fusion results of m? by the pro- 
posed method achieves 0.8 which achieves belief level to final result of DismP 0.8254, indicating 
that the proposed method provides faster convergence and is less computationally intensive. In 
the end, the great difference in m3, where almost all belief is given to Ap, is caused by that evi- 
dence m, may be especially sensitive to the typical characteristics of the kind of A2. The results 
in Table 2 show that the A, with the second largest belief that is much larger than other proposi- 
tions except A,. Therefore, A, could be seen as a potential identification results as well. 

This work provides a modified dissimilarity measure and evidence information volume 
measure to determine the weighting factors of evidences involved in the fusion process. The 
modified dissimilarity measure includes both the distances and all conflicts between evidences 
and the evidence information volume of each evidence is measured by Deng entropy. Com- 
pared with other methods, the proposed method gives a more specific and faster aggregated 
result. In addition, it is an efficient method in some decision-making applications. 


Identification of major land use activities contributing to river 
pollution: An application 


In this section, the proposed method is applied to a case study in identifying the major land 
use activities contributing to river pollution and the required data are collected by water qual- 
ity monitors. 

The Manahara River lies in the Kathmandu Valley, Nepal, with a watershed of 256 km’. 
The river originates from Manichud Lekh (ridge) at an elevation of 2352 m and has a length of 
30 km. The river originates from a pristine forested region and flows through forest, rural, 
semi-urban, and urban areas. The water quality was monitored monthly in seven different 
sites (Sites 1-7) of the river by sensors. The data of Sites 1 to 7, respectively Salinadi, Sankhu, 
Brahmakhel, Bode, Sinamangal, Imadol, and Chyasal are obtained from [44]. The land use and 
anthropogenic activities in transition of different sites are given in Table 4. 
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Table 5. Water quality of Manahara river in different sites. 


Table 4. Land use and river activities in site transition. 




















Site Land use and river activities 
1to2 A1°, A2 
2to3 A2, A3 
3to4 A2, A3, A4 
4to5 A2, A3, A6 
5to6 A2, A3, A5, A6 
6to7 A6 








aNote: A1: Forest, A2: Agriculture, A3: Bathing, washing & cleansing, A4: Rural settlement (sparse), A5: 
Industries, A6: Urban settlement (dense). 





https://doi.org/10.1371/journal.pone.0177695.t004 


The water quality of the river during the low flow months in 2006 is given in Table 5. Water 
qualities are good in upstream and bad in downstream regions and varied gradually from Site 
1 to 7. Each monitor records the amount of each chemical element in river when the water 
















































































£ Site DO”, BOD Free TA Cl, mg/L NO3-N, mg/L PO4 NH3 EC, uS/cm TDS, mg/L 
£ mg/L mg/L CO2, mg/L mg/L -P, mg/L -N, mg/L 
1 9.2 1.2 7.1 26 6.3 0.18 0.1 0.07 60.33 39.9 
> 2 9.2 1.3 9.2 39 6.7 0.32 0.2 0.2 71 47.1 
s 3 9.1 6.8 12.5 39 7.2 0.64 0.3 0.26 82 54.4 
S 4 8.5 14.9 17.8 67 9.0 0.72 0.4 0.61 94 62.4 
= 5 6.2 27.5 48.3 103 22.2 0.83 0.9 1.16 281.33 186.4 
6 2.2 154.9 106.8 352 57.6 2 4.2 4.42 925 613.0 
7 0 155.0 107.5 379 71.6 3.37 5.3 4.62 957.73 667.0 
1 9.6 1.2 6.2 25 7.5 0.19 0.1 0.08 58.33 38.6 
2 8.9 1.8 6.9 31 8.4 0.35 0.2 0.22 70 46.4 
5 3 7.9 8.7 13.4 40 9.0 0.7 0.3 0.28 81.67 54.1 
© 4 7.3 16.7 14.1 73 13.2 0.75 0.4 0.66 116.33 76.7 
= 5 5.8 36.9 32.6 151 38.7 0.88 1.1 1.21 405.33 267.5 
6 5.7 83.8 55.9 276 68.7 1.82 3.3 3.25 719.33 474.7 
7 2.5 85.3 59.5 277 78.5 3.26 3.4 8.84 796 474.7 
1 8.6 1.8 8.5 32 6.2 0.18 0.1 0.07 49.67 32.8 
2 7.2 2.8 8.7 33 7.4 0.25 0.2 0.22 75.33 49.9 
z 3 6.8 9.8 9.2 37 8.6 0.6 0.2 0.24 78 51.8 
= 4 6.5 17.8 9.3 41 11.8 0.63 0.4 0.39 91 60.3 
5 4.3 39.3 17.3 63 26.5 0.79 1.3 1.32 192.67 127.3 
6 3.3 113.6 37.1 167 43.2 1.79 2.9 2.86 473 278.3 
7 2.0 149.2 77.0 279 48.4 3.09 3.2 3.22 544.67 359.0 
1 7.2 2.16 8.6 32 9.3 0.2 0.1 0.07 58.33 38.4 
2 6.7 3.8 13.4 33 9.7 0.26 0.1 0.22 97.33 64.1 
= 3 6.2 10.5 13.9 41 10.9 0.62 0.3 0.25 114.33 75.3 
= 4 6.0 18.4 13.9 41 13.3 0.67 0.4 0.55 122.33 80.6 
5 2.8 56.2 13.9 62 17.4 0.75 1.2 1.15 181.33 119.5 
6 2.3 119.0 24.9 130 25.0 2.24 1.8 3.04 296 195.0 
7 1.8 167.3 25.2 139 29.3 3.83 1.8 3.1 389.33 256.6 






































aNote: DO: Dissolved oxygen, BOD: Biochemical Oxygen Demand, TA: Total Alkalinity, Cl: Chloride, EC: Electrical conductivity, TDS: Total Dissolved Solids. 





https://doi.org/10.1371/journal.pone.0177695.t005 
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stream flow past each site. The actual variation amount of chemical element in each site should 
only consider the difference with the previous site. The chemical elements criteria in Table 5 
could be divided into benefit criteria and cost criteria. 

The change in water quality from one site to next site is normalized by Eqs (28) and (29). 
For a benefit criterion, 
X,- a 


N, = Xmax — cn (28) 


min 


where )~_,N, = 1 and X”®™ is the maximum value and X”"” is the minimum value of the cri- 


terion. For a cost criterion, 


X,,, — X, 
N, E = = (29) 
xXmax — X 


min ; 


where `; N, = 1 and X”* is the maximum value and X”"” is the minimum value of the 
criterion. 

For example, the amount of DO (mg/L) in February is 9.2, 2.2 and 0 record in site 1, 6 and 
7 respectively. DO (mg/L) is a benefit criterion and the normalization of site 6-7 is as Eq (30) 
shows. 





X,-X, 22-0 
N, = a 0.239 (30) 
The criteria, except the DO, are all cost criteria. The normalized chemical criteria of February 
are shown in Table 6. 

In Table 6, each chemical criterion is seen as an evidence. Therefore, there are ten evidences 
need to be fused to identify which site causes the major pollution. According to the proposed 
method, the results of the river water quality monitoring and identification of February are 
given in Table 7. 

From the Table 7, it shows that the A6 urban settlement (dense) is the major cause for the 
river pollution. All the fusion results are listed in Table 8. 

Table 8 shows that the major source of water pollution in all months are related to A6, i.e. 
urban settlement (dense). This result is true and reasonable as the adjacent area of Site 6 and 7 
is A6 that discharged untreated sewage directly to those sites [45, 46], that have the worst 
water quality with very high BOD, nutrients (NO3-N and PO4-P), NH3-N, EC, and TDS with 
very low dissolved oxygen. Untreated sewage has high BOD, nutrients (NO3-N and PO4-P), 
NH3-N, EC, and TDS with very low or almost zero dissolved oxygen [47]. However, the com- 
bination (A2, A3, & A6) in Site 4 to 5 and the combination (A2, A3, A5, & A6) in Site 5 to 6 
also have urban settlement (A6), which contributed insignificantly because urban settlement 


Table 6. Normalized chemical criteria in different sites of February. 





















































£ BOD Free TA Cl, mg/L NO3-N, mg/L PO4 NH3 EC, uS/cm TDS, mg/L 
8 mg/L | CO2,mg/L | mg/L -P,mg/L | -N, mg/L 
0.000 0.021 0.037 0.006 0.044 0.019 0.029 0.012 0.011 
S 0.036 0.033 0.000 0.008 0.100 0.019 0.013 0.012 0.012 
S 0.053 0.053 0.079 0.027 0.025 0.019 0.077 0.013 0.013 
S 0.082 0.303 0.102 0.201 0.034 0.096 0.121 0.209 0.198 
. 0.828 0.583 0.705 0.543 0.367 0.635 0.716 0.717 0.680 
0.000 0.007 0.076 0.215 0.429 0.212 0.044 0.036 0.086 





https://doi.org/10.1371/journal.pone.0177695.t006 
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Table 7. Fusion results of February by using the proposed method. 








Land use and A2 A6 A1 A2 A2 A3 A2, A3, A4 A2, A3, A6 A2, A3, A5, A6 
river activities 
Fusion Result 0.057 0.615 0.000 0.151 0.000 0.134 0.041 

of February 























https://doi.org/10.1371/journal.pone.0177695.t007 


covered only a smaller land cover, is far from the river, and sewage outfall is absent in Site 4 
and lower in Site 5. 

The combination (A2, A3, A5, & A6) also has an industry which contributed insignificantly 
because there is only one beverage industry and its wastewater is diluted by the river flow and/ 
or the sampling time would have been different than the discharge of concentrated industrial 
wastewater. In addition, the contribution of organic load by agriculture is much lower than 
that of sewage outfall [48]. Due to this, agriculture (A2) contribution has resulted in an insig- 
nificant pollution. Furthermore, the organic load contribution by forest (A1) is lower [49, 50], 
due to which the combination (A2 & A3) has insignificant contribution too. 

In the four months from February to May, the results are similar. This is because the rivers 
have low flow in these months with February being winter and March to May being the pre- 
monsoon season [47]. The low flow in a river results in a high concentration of pollutants due 
to the lack of dilution. Therefore, the results of the application of the proposed method in iden- 
tifying water polluting land use activities match the actual situation. 


Conclusion 


Solving the problem of conflicting information fusion is still under continuous discussion. In 
this paper, a combination approach of evidences with a new weighting factor has been pro- 
posed based on a modified dissimilarity measure and Deng entropy between BBAs. The new 
weighting factor contains two part: weight of similarity and weight of entropy. The weight of 
similarity describes the degree of mutual support between BBAs. The larger the dissimilarity 
with others, the less the weight of similarity of a BBA. The weight of entropy describes the 
information volume of each BBA. If a BBA has a larger entropy value, meaning it contains 
more information, the weight of entropy of that BBA would be larger. 

After analyzing the features and limitations of the existed dissimilarity measures, a modified 
dissimilarity measure based on the Hamacher T-conorm fusion rules mixing the probabilistic- 
based distances with the all conflict between BBAs has been developed. Also, the weight of sim- 
ilarity is determined by dissimilarity measures. The information volume of a BBA is obtained 
by Deng entropy and is used in the determination of the weight of entropy. 


Table 8. Results of the application of the proposed method. 





























Land use and river activities Fusion Result l 
Feb Mar Apr May 
A2 0.057 0.074 0.171 0.179 
A6 0.615 0.770 0.619 0.606 
A1 & A2 0.000 0.000 0.000 0.000 
A2 & A3 0.151 0.114 0.112 0.142 
A2, A3, & A4 0.000 0.000 0.000 0.000 
A2, A3, & A6 0.134 0.038 0.087 0.069 
A2, A3, A5, & A6 0.041 0.001 0.005 0.002 














https://doi.org/10.1371/journal.pone.0177695.t008 
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The new weighting factor based on the modified dissimilarity and Deng entropy is pre- 
sented and applied in the weighted average combining belief function. Several numerical 
examples analysis show that the proposed method not only obtains a more reasonable and spe- 
cific result but also has a faster convergence rate. A real application of determining the major 
land use activities contributing to river pollution is implemented by the proposed method. 
Result shows that the urban settlement (dense) is the major source of water pollution. This 
new approach can be of great interest for decision makers in devising strategies to control 
water pollution and environmental management. 
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